Toponym Disambiguation Using Ontology-Based Semantic Similarity
نویسندگان
چکیده
We propose a new heuristic for toponym sense disambiguation, to be used when mapping toponyms in text to ontology concepts, using techniques based on semantic similarity measures. We evaluated the proposed approach using a collection of Portuguese news articles from which the geographic entity names were extracted and then manually mapped to concepts in a geospatial ontology covering the territory of Portugal. The results suggest that using semantic similarity to disambiguate toponyms in text produces good results, in comparison with a baseline method.
منابع مشابه
Semantic Similarities between Locations based on Ontology
Toponym disambiguation or location names resolution is a critical task in unstructured text, articles or documents. Our research explores how to link ambiguous locations mentioned in documents, news and articles with latitude/longitude coordinates. We designed an evaluation system for toponym disambiguation based on annotated GEOCLEF data. We implemented a node-based approach taking population ...
متن کاملToponym Disambiguation in English-Lithuanian SMT System with Spatial Knowledge
This paper presents an innovative research resulting in the English-Lithuanian statistical factored phrase-based machine translation system with a spatial ontology. The system is based on the Moses toolkit and is enriched with semantic knowledge inferred from the spatial ontology. The ontology was developed on the basis of the GeoNames database (more than 15 000 toponyms), implemented in the we...
متن کاملToponym Extraction and Disambiguation Enhancement using Loops of Feedback
Toponym extraction and disambiguation have received much attention in recent years. Typical fields addressing these topics are information retrieval, natural language processing, and semantic web. This paper addresses two problems with toponym extraction and disambiguation. First, almost no existing works examine the extraction and disambiguation interdependency. Second, existing disambiguation...
متن کاملUn Sistema de Extracción de Información Basado en Ontologías para Documentos en el Dominio de las Tecnologías de Información An Ontology-Based Information Extractor for Data-Rich Documents in the Information Technology Domain
This paper presents an information extraction method, suitable for data-rich documents, based on the knowledge represented in a domain ontology. The extractor combines a fuzzy string matcher and a word sense disambiguation (WSD) algorithm. The fuzzy string matcher finds mentions of terms combining character-level and token-level similarity measures dealing with non-standardized acronyms and inc...
متن کاملEarly ontological word - sense - disambiguation prototype
Semantic similarity and relatedness between concepts have been extensively studied in different areas ranging frompsychology to computational linguistics. In this paper we address the problem of determining the similarity betweenconcepts defined in a knowledge source such as an ontology. We propose a concept similarity algorithm based ongeometric models for representing concepts and...
متن کامل